Tag

#prompt injection

4 articles

Designing AI agents to resist prompt injection

OpenAI reveals new defenses against prompt injection attacks and social engineering in ChatGPT, strengthening AI agent security through constrained workflows and enhanced data protection.

Mar 1241

OpenAI's new training dataset teaches AI models which instructions to trust

OpenAI has released IH-Challenge, a training dataset designed to teach AI models to reliably prioritize trusted instructions over untrusted ones, improving security and defense against prompt injection attacks.

Mar 1143

Improving instruction hierarchy in frontier LLMs

OpenAI introduces IH-Challenge, a training method that improves instruction hierarchy in frontier LLMs, enhancing safety steerability and resistance to prompt injection attacks.

Mar 1041

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT

Learn to implement Lockdown Mode and Elevated Risk labels in AI chat interfaces to defend against prompt injection attacks and data exfiltration, similar to OpenAI's new security features.

Feb 2574